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(~| ' Abstract 



We find exact and asymptotic formulas for the average values of several statistics 
on set partitions: of Carlitz's (/-Stirling distributions, of the numbers of crossings in 
linear and circular representations of set partitions, of the numbers of overlappings 
and embracings, and of the numbers of occurrences of a 2-pattern. 



oo : 1 Introduction 

^ ■ Among all basic combinatorial structures, set partitions are probably one of the most 

^r^ ■ attractive. Like their close cousin, the permutations, they have a remarkably rich combi- 

natorial structure. This is partly due to the fact that they can be represented in many 
equivalent way: in terms of words, functions, graphs, placements of non-attacking rooks 
^ I in a triangular diagram, line diagrams, etc. (see Figure [1] for some examples). During 

the past decades several statistics (here, statistic means an integer valued combinatorial 
parameter), also called parameters, on these different representations of set partitions 
have been introduced and studied. For instance, natural statistics on set partitions are 
the number of inversions in restricted growth functions or the numbers of crossings of arcs 
in linear and circular representations of set partitions. In very rare cases, one can deter- 
mine the exact distribution of the statistic (i.e., for each n, the number of set partitions 
of {1,2, . . . ,n} which have a certain value of the considered statistic), but most often, 
even though one has good information on the parameter, such as, for example, in form 
of an explicit formula for the generating function, there is no exact expression for the 
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(a) linear representation 
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(c) rook placement 
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(d) line diagram 



Figure 1: Some representations of tiie set partition vr = 1 10/2 3 7 9/4/5 6 12/8 11 

distribution. The by far most important descriptive quantity associated to a statistic is 
its average value (also called mean value or expectation). The purpose of this paper is to 
compute the average values of several statistics on set partitions. Some of these statistics 
have been introduced in the literature previously and others are new. 

2 Background and main results 

2.1 Basic facts 

The central objects of this paper are set partitions. A partition vr of a set 5* is a collection 
of nonempty and mutually disjoint sets n = {Ai, A2, . . . , A^} whose union is S. The 
sets Ai are called the blocks of the partition, and a partition into k blocks is called a 
k-partition. Let us denote by Il{S) the family of all partitions of S and by n'^(S') the 
family of all /^-partitions of S. For instance, n^({a, b, c, d}) consists of the elements: 

{{a},{b,c,d}} {{b},{a,c,d}} {{c},{a,b,d}} {{d},{a,b,c}} 
{{a,b},{c,d}} {{a,c},{b,d}} {{a,d},{b,c}} 

The number Sn,k oi /^-partitions of an n-set is called Stirling number of the second kind, 
while the number i?„ of all partitions of an n-set is called Bell number. Hence, by defini- 
tion, we have 5„ = Yll=i Sn,k- 
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Almost all statistics considered on set partitions require that the set S to be par- 
titioned is linearly ordered. Without loss of generality, from now on, we will con- 
sider only partitions of subsets of the set P = {1,2,...} of positive integers. With 
[n] := {l,2,...,n}, we will set n„ := n([n]), H^ := U''{[n]) and U := Un>i Hn- A con- 
venient way, called standard form (also called canonical order), to write a set partition 
TT G n(S'), 5* C P, is to arrange the blocks of tt in increasing order of their minima and each 
block in increasing order. For instance, the partition tt of [9] consisting of the five blocks 
{1, 4, 7}, {3, 9}, {2}, {5} and {6, 8} is written in standard form as vr = 1 4 7/2/3 9/5/6 8. 

A substantial number of combinatorial statistics on set partitions were defined in 
terms of the restricted growth function representation for set partitions. It is well-known 
that k\Sn,k is the number of surjections from an n-set to a k-set. Actually, Stirling numbers 
also count a class of functions, called restricted growth functions. A word (or equivalently, 
a mapping) w = wiW2- ■ ■ Wn is said to be a restricted growth function (RGF for short) 
of length n and size k if: 

1. Wi = 1, 

2. Wi+i < max{wi, W2, ■ ■ ■ , Wi} -|- 1 for 1 < ? < n — 1, 

3. max{wi, W2, ■ ■ ■ , Wn} = k. 

The usual way to associate bijectively set partitions of [n] into k blocks and RGF of 
length n and size k works as follows: to a partition tt = B1/B2/ ■ ■ ■ / Bk of [n] written in 
standard form, we associate the RGF w(7r) = W1W2 ■ ■ -Wn such that Wi = j if and only 
if z G Bj. For instance, under this correspondence, the partition vr = 1 4 7/2/3 9/5/6 8 
is sent to the RGF w{7t) = 12314515 3. The reader who knows the plethora of per- 
mutation and word statistics can imagine the amount of statistics that can be defined 
on set partitions via restricted growth functions. For instance, several statistics were de- 
fined in terms of inversions and noninversions, and more generally, in terms of counting 
occurrences of patterns in restricted growth functions. 

Other interesting representations of set partitions suggest other combinatorial param- 
eters. We will present some of them when they are needed. 

2.2 Z-statistics and main results 

Computation of average values of combinatorial parameters is often entertaining, but can 
be a very difficult task in some cases (for instance, consider the parameter "length of the 
longest increasing subsequence" in permutations). In this paper, we will consider some 
statistics whose computation of their (at least exact) average value is not trivial, but 
not as difficult as the one we mentioned previously. For instance, two of the parame- 
ters considered in the present paper are the numbers of crossings in linear and circular 
representations of set partitions (see Figured]). A little moment's thought will convince 
the reader that the computation of their (at least exact) average value is not completely 
trivial. 
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Depending on which statistic on set partitions is considered, it is more or less difficult 
to find its average value. As an example, let us consider the statistics 'number of blocks' 
and 'number of crossings'. While a simple direct combinatorial argument gives the average 
number of blocks in a random set partition of [n] (it is equal to -^^ — 1), a moment's 
thought will convince the reader that the computation of the average number of crossings 
in the circular representation of a random set partition of [n] , which, as we shall see later, 
is actually equal to 

nBn+i 3n n(4n+l)5„_i [■ri\Bn-2 , fn\Bn-4: 



2 Bn 2 2 Bn \2J Bn \4:J B 

is not completely trivial. 

Perhaps the most natural and powerful approach to deal with average values is to 
use the linearity of the expectation by finding "nice decompositions" of the statistics we 
consider. This was done in the author's thesis [20] for the average number of crossings in 
the linear representation of a random set partition. It turns out that the decomposition 
used there is shared by a large number of existing statistics on set partitions and that their 
average values can be obtained almost "mechanically". In order to make these remarks 
more precise, we need additional terminology. 

Given a subset S* C P with cardinality \S\ = n, the standardization map sts is the 
(unique) order-preserving bijection sts : 5 — )■ [n]. When the subset S is clear from the 
context, we will drop the subscript. We let sts act element-wise on objects built using 5* as 
label. For instance, the set partition vr = 2 9/4 10/5/7 11/8 oi S = {2, 4, 5, 7, 8, 9, 10, 11} 
is sent after standardization to the set partition st(7r) = 1 6/2 7/3/4 8/5. 

Definition 2.1. A set partition statistic STAT is said to be a Z-statistic if for any tt G 11, 

STAT{7r)= J2 STAT{st{{A,B})). (2.1) 

The terminology is inspired from a paper of Galovich and White [H], but originally, 
the term Z-statistic is for a famous statistic introduced by Bressoud and Zeilberger [2]. It 
is worth noting that via the RGF representation, our Z-statistics on set partitions could 
be seen as Z-statistics on words (with the definition of Galovitch and White). 

For the present paper, the interest of Z-statistics on set partitions is that the com- 
putation of their average value in a random partition is essentially equivalent to the 
computation of their average value in a random 2-partition (see Theorem 13. ip . the latter 
being often easier to calculate. 

An important number of existing statistics on set partitions are in fact Z-statistics. 
In this paper, we will compute average values of Carlitz's g-Stirling distributions, the 
numbers of crossings in linear and circular representations, overlappings and embracings, 
and the number of occurrences of a 2-pattern in restricted growth functions. 
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2.2.1 Carlitz's g-Stirling distributions 

The most common g-analogs of Stirling numbers of the second kind found in the literature 
are probably the g-Stirling numbers Sn,k{(l) and Sn,k{,(l) essentially described by Carlitz 
and Gould |3l |T6] . They can be defined as follows: 

SnAl) = l''' '5„_i,,_i(g) + [k]^ 5„_i,fc(g) (n > A; > 1), (2.2) 

and Sn,k{<l) = l'^^' Sn,k{q), with Sn,k{q) = Sn,k{<l) = Snk ii n = or k = 0. 

Our interest in these g-Stirling numbers comes from the fact that they arise as gener- 
ating functions for the distribution of several statistics on set partitions. Such statistics 
have been given by Milne \27], Garsia and Remmel [T3], Sagan [30], Wachs and White [5^ . 
White [35], Leroux |25], de Medicis and Leroux [7], Ehrenborg and Readdy [9], Ksavrelof 
and Zeng [21]. This list is probably not exhaustive. It is natural to ask for the expectation 
of these "g-Stirling distributions" . 

We will say that a parameter stat defined on set partitions has the g-Stirling distri- 
bution S{q) {S{q), respectively) if its generating function on each 11^ is given by Sn,k{q) 
{Sn,k{q), respectively), i.e., for any n > A; > 1, we have 

Y^ q'""'^^^ = SrrM (= SnAq), respectively). 

Trenfc 

Theorem 2.2. Let stat be a parameter with the q-Stirling distribution S{q). Then the 
average value fin of stat in a random set partition of [n] satisfies 

lBn+2 , fn 1\ Bn+i f n I 



'n 
i2 



"— 4^n2 + 4H7n-2 + 4l t^'^' 



" ^+ '°f'°'^" (l + o(l))| asn^oo. (2.4) 



2 log n \ log n 

The average value fin,k of stat in a random set partition of [n] into k blocks satisfies 

/in,. = ^n(A;-l)-iQ+i(n + l-A:)^ (2.5) 

= lri{k~l)-^(^^+o{l) asn^oo. (2.6) 

Note that it is almost immediate to derive from the above results the expectations of 
the g-Stirling distribution S{q), which we denoted by /i„, and fLn,k, respectively. Indeed, 

by the relation Sn,kiq) — q~^^' Sn,k{q)i ^^ have 



" fc=l ^ ^ 
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2 ' ' 2\2J 2^ ' S, 

3 5„+2 fnl\ Bn+i (n 1 
^^n = --,—Fy \- 77 + 7 



and thus 

K,k = ^n{k -l)-7T(r,) +0(^^ + 1"^)" ^'''""^ 

n,fc 

A Bn ' \2 ' AJ En \2 ■ 4 

where the last equahty follows from the identity X]fc=i ^(^ ~ ^)Sn,k = -B„+2 — S-B^i+i + B^ 
(for a proof, see e.g. [2S])- 

Almost all known statistics with the g-Stirling distribution S{q) are in fact Z-statistics. 
In order to prove Theorem 12. 2^ we will work with the first known parameter with the 
distribution S{q): this is the statistic \os (originally denoted J"^), first introduced by 
Milne (see [271 Section 4]) to answer a question of Garsia. The parameter \os can also be 
defined by using the non-attacking rook placement model for set partitions (see e.g. j34j). 

It is not hard to see that \os can be defined for a set partition vr = Bi/ B2/ ■ ■ ■ /B^ 
written in standard form by 

los(7r) = 1^21 + 2\B^\ + . . . + (A; - l)\Bk\. 

It is easy to see that the parameter \os is a Z-statistic. Indeed, for a 2-partition Bi/ B2 
written in standard form, we have \os{Bi/ B2) = I-B2I whence for vr = B1/B2/ ■ ■ ■ /Bk we 
have 

k 

J2 STAT{st{B,/B,))= Y, \B,\ = Y,{j-l)\B,\=los{rr). 

^<i<j<k i<i<j<k j=2 

2.2.2 Number of linear crossings 

One of the most widely occurring property of set partitions is the "noncrossing" property. 
This is essentially due to the important number of works on noncrossing partitions which 
appear in different fields of mathematics. The terminology originated probably from the 
graphical representation of set partitions (see e.g. 133|). The linear representation of a 
partition of a set S* C P is obtained as follows: draw the elements of S in increasing order 
on a line and join successive elements of each block by arcs drawn in the upper half-plane. 

Alternatively, the elements may be represented on a circle and circularly successive ele- 
ments of each block are joined by chords: this is the circular representation. For instance, 
the linear and circular representations of the partition vr = 1 10/2 3 7 9/ A/5 6 12/8 11 are 
given in Figure [TJ Linear and circular representations of set partitions suggest many 
statistics. The most studied is the number of crossings, but other statistics like nestings, 
alignments, fc-distant crossings, fc-crossings, fc-nestings, etc. have also appeared in the 
literature (see e.g. [il [61122]). 

For a set partition vr G 11, we will denote by cr^^^Tr) the number of crossings in the 
linear representation of vr. We will refer to such crossings as linear crossings, in order 
to distinguish these from the circular case. Formally, a linear crossing is a sequence 
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(ii, j'l), {12,32) of edges of vr, such that ii < 12 < ji < J2- Here, an edge of vr is just a 
pair {i,j) of consecutive elements i < j in the same block B, where consecutive means 
that there is no element p & B such that i < p < j. For instance, if vr is the set partition 
represented in Figured! then cr^^^ir) = 4. In order to show that cr^^^ is a Z-statistic, it 
suffices essentially to observe that a linear crossing is composed of two edges from distinct 
blocks (details are left to the reader). 

Theorem 2.3. The average number fin of linear crossings in a random set partition of[n] 
satisfies 

"' ^+ '°f'°g" (1 + 0(1))) «.„^oo, (2,8) 



21ogn \ logn 

The average number fin,k of linear crossings in a random set partition of [n] into k blocks 
satisfies 

/..,. = ^n(A:-l)-^Q)+^(n + l-A:)^ (2.9) 

= ^"(^-l)-^(2)+''^^^ asn->oo. (2.10) 

2.2.3 Number of circular crossings 

For a set partition vr G II, we will denote by cr^'^^n) the number of crossings in the 
circular representation of vr. We will refer to such crossings as circular crossings. For- 
mally, a circular crossing is a sequence (ii, ji), {12, J2) of "circular edges" of vr, such that 
ii < 12 < ji < 32- Here, a circular edge of vr is a pair (i, j) such that i, j are consecutive 
elements in the same block of vr or z = min(i?) and j = max(i?) for some block 5 e vr. 
For instance, if vr is the set partition represented in Figured! then cr('^^(vr) = 9. We leave 
the verification that cr^^^ is a Z-statistic to the reader. 

Theorem 2.4. The average number fin of circular crossings in a random set partition 
of [n] satisfies 

_ n Bn+i , 3n n(4n + 1) Bn-i (n\ Bn-2 , (n\ E„_4 



2 Bn 2 2 B„ \2J Bn \4/ _B„ 

= -^fl + i^^ (1 + 0(1))') as n^ 00. (2.12) 

2 logn \ logn y 

The average number fin,k of circular crossings in a random set partition of[n] into k blocks 
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satisfies 



l__/i. i\ ^/A.^ r7_ , 1 N *S'n-l,fc-l T ^ / ^ \ 'S'„_2,A;-2 



/.„,, = -n(fc- 1) - -n{4n-5k + l)^^-^^ - 10 _ ^^^^ (2.13) 

, n\ Sn-4:,k-2 



4:J Sn,k 

= -n{k - I) + o{l) asn^oo. (2.14) 

2.2.4 Number of overlappings 

Flajolet and Schott [I2] considered a special class of set partitions, called non- overlapping 
partitions. With the implied order structure, two sets B and B' are said to overlap if 
niin(i?) < niin(i?') < niax(i?) < niax(i?'). Then, a partition is said to be non- overlapping 
if any two blocks do not overlap. This property of non-overlapping could be seen graph- 
ically by considering the line diagram (see Figure [1]) of a set partition. This suggests to 
consider statistics like the numbers of overlappings, embracings ("nestings of blocks"), 
etc. 

The number of overlappings of a set partition vr will be denoted of/(7r). For instance, 
if TT is the set partition represented in Figure [H then of /(vr) = 4. We leave the verification 
that ovl is a Z-statistic to the reader. 

Theorem 2.5. The average number fin of overlappings in a random set partition of [n] 
satisfies 

= lf^yfl + 2!2ii^(l+o(l))) asn^o.. (2.16) 

4 \lognJ \ logn J 

The average number fin,k of overlappings in a random set partition of [n] into k blocks 
satisfies 



^^n,k = ^(: +nik-l)^^^^-^^-:-in + k-l):^^ (2.17) 



1/k 

2 V2 



+ o(l) asn-^oo. (2.18) 



2.2.5 Number of embracings 

In this section, we consider a natural partner for overlappings. Say that two sets B 
and B' embrace if, with the implied order structure, min(i?) < min(i?') < max(i?') < 
max(i?). The number of embracings of a set partition vr will be denoted emb{7r) and 
can be easily computed via the line diagram representation. For instance, if vr is the set 
partition represented in Figured! then em6(7r) = 4. We leave the verification that emb is a 
Z-statistic to the reader. 
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Theorem 2.6. The average number fin of embracings in a random set partition of [n] 
satisfies 

1 Bn+2 5 i?„+i 3 n i?„„i 

"» = 4-B7 - i-^ -^ i + i-i7 •'■''" 

1/ " ^Vl + 2 '°f'°g" (l+o(l))'| a.„^oo. (2.20) 



4 \\ognJ \ logn 

The average number fin,k of embracings in a random set partition of [n] into k blocks 
satisfies 

1 /'k\ 1 ,, ^ S„ i._i n Sn-i k-2 

f^n,k = ^ U - 9 (^ - ^)^ + 9 ^V"^ (2.21) 

= ^L)+^(^) asn^oo. (2.22) 

2.2.6 Number of occurrences of a 2-pattern 

We say that a word a = o"iO"2 . . . cr^ in {1, 2}* is a 2-pattern if it contains the two letters 1 
and 2. An occurrence of the 2-pattern cr in a word w = WiW2 ■ ■ ■ Wn G P* is an r-tuple 
(zi,i2, . . ■ ,ir) such that st{wi^Wi^ . . -Wi^) = a, where st is the standardization map (see 
Section Q. For instance, in the word u; = 243 1 226 73 7, the 3-tuples (1,3,9), (4,5,6) 
and (7, 8, 10) are occurrences of the pattern 12 2, while the 3-tuples (2, 3, 9) and (3, 5, 6) 
are occurrences of the pattern 211. The number of occurrences of a pattern a in a word w 
will be denoted occ„{w). Since a set partition can be identified with its restricted growth 
function, we will set ocCo-(7r) = occ„{w{7!-)), where w{7i) is, as usual, the RGF associated 
to vr. We leave the verification that occ^ is a Z-statistic for any 2-pattern a to the reader. 

Theorem 2.7. Let a = criO"2 . . . cr^ be a 2-pattern of length r, and let jin denote the 
average number of occurrences of the pattern a in a random set partition of [n]. 

(i) For fixed n, the average value fin depends only on the first letter of a. 

For i E {1, 2}, denote by fin the common value of fin for patterns r (of length r) 
satisfying Ti = i. 

(a) For any integer n > 1, we have 

/^i-^^^ + /^i-^^^ = h ''-^^-^-''-^^-'\ (2.23) 



For j = 0, . . . , r, set 



^^("'^»(G)nG!,i)^ 
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(in) For any integer n > 0, we have 



.lr-^Ep»^-K:) 



2\rJ B„ 2-+1 

3=0 

Bn+i-i 1 / fn\ 1 / n \\ Br, 



!.-'^-EM")^-5 : -ii-Jrf" 



i=o 

1 fn\ Bn+l^r . (-1 



2\rJ Bn 2'-+! 

('zf^ For any 2-pattern u of length r, we have the asymptotic approximation 

(a) _ J_ 2/, ^r-2 ( , /„ c.^^Og\ogn , /'loglogU 



^r = Tr7^'(log<"Ml-(^-2)^^^ + o^^P^ , n^oo. (2.25) 
2r! \ logn \ logn // 

(v) For the 2-patterns a of length 2, i.e., o G {12,21}, we have the more precise 
asymptotic approximation 

(„\ V? ( 1 log logn , , ,,\ , , 

'"^ =T^-r^-7^^^ 1 + ^W , n^oo. 2.26 

4 \ logn (logn) / 

To illustrate our result, we give the values of /in for 2-patterns a of length r < 3. For 
patterns of length r = 2, we have 

,(21) _ 1 -B„+2 (^.\\ Bn+l , f!^_}_\ 1 fn\ Bn-l 

^"^ ~S B^ U Sy) 5„ +U Sj 2^2^ 5. ^''''^ 

(,r,-. 1 Br,+o (n \\ B„,-i /n^ n l\ 1 /n\ i?„_i 



8 Br, V4 8/5„ V4 2 8/ 212/ 5, 



-"71 



For patterns of length r = 3, we have 

,.(211) _ ,.(212) _ (221) ^_Bn+2 , 1 /^ , l^ ^ri+l _\( 2 _\ 

/^n -^^n -/^n " ^g^^ ^SV 2;* 5„ SV 2 

+ T^^'n ^ — - (^ — 1 



12 y 11 B„ 2 \o J B 

;,ai2) ^ a21) ^ a22) ^ ]_Bn+2 _ }, f ^ ^ i) ^^ ^ }, f ^2 _ }, 

^'' '^^ '^^ 16 Br, 8 V 2/ 5„ 8 V 2 

^ 1 , ,,/ 7\ 5„_i l/n\5„_2 

H n(n — 1) n . 

12 ^ ^V 2; 5„ 2V3; 5„ 

Theorem 2.8. Let a = aia2 . . . ar be a 2-pattern of length r, and let fi^l denote the av- 
erage number of occurrences of the pattern a in a random set partition of [n] into k blocks. 
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(i) For fixed n and k, the average value jj,"], depends only on the first letter of a. 

For i E {1,2}, denote by lJi^k~^ ^he common value of fx^l for patterns r (of length r) 
satisfying ti = i. 

(a) For any integer n > k > 1, we have 

i^'T' + i4:r'' = h ^""^'^""^"'"' - (2.29) 

(Hi) For any integer n> k > 1, we have 



, , Ort.-V-1 — i.k ^ I n,\ Or,-4-l_rA- ( — 1) "Jn 

[n) 



(fTi=2) ^ V^ / N '3n+2-j,fc 1 / g \ On+l-r-,fc \-^) Jn,k-2 

._„ »-'n,fc 2 \T J On,k 2 On,k 



(cri=l) _ V^ / X Sn+2-j,k 1 / flT'] 1 / IT- \\ ^i 



'n+2-r,fc 



^"■^ " ^/'^^ S^,, ^2\\rJ 2\r-lJJ S„,, 

1 ff^\ Sn+l-r,k {~^y Sn,k-2 



2 V'"/ '^n,k 2 »-'n,fc 

where pj{n) is given by (I2.24p . 

('zi'j 74s n —> oo, we have the following asymptotic approximations 



f^lT' = i:PAn)k'-^ -l(l)k'-' + oil) (2.30) 

i=o ^ ^ 



/n particular, for any 2-pattern a of length r , we have 



/^n,fc ~ o^r-l.l (^ - 1) «^ ^ ^ °«- 



2A;^-ir 



2.3 Plan of the paper 

In the next section, we will prove the key result of the paper (Theorem 13. ip . which 
expresses average values of Z-statistics as coefficients in certain generating functions. 
Moreover, there we introduce the tools which are necessary to use our key result. The 
purpose of the other sections (except the last) is to prove the results presented in Section 2. 
Finally, we will conclude the paper with some remarks, notably that the methodology 
could be adapted to other exponential families and to more general statistics. 
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3 Average value of a Z-statistic in a random set par- 
tition 

We said in the previous section that the computation of the average value of a Z-statistic 
in a random set partition is essentially equivalent to the computation of its average value 
in a random 2-partition. Let us make this more precise. Given a Z-statistic stat, let 
^stat ._ ^^^j_^2 stat{Tr), and let V"2'^*"*(a;) be the exponential generating function of the 

sequence (f^*2*) --' i-^-^ 



i>0' 



vrix) = E <",* ^ = E E ^^«^(-) ^- (3-1 



n\ ^ — ' ^ — ' n\ 

n>0 n>0 TTgffi 



The following theorem is the key result of the paper. 

Theorem 3.1. Let stat be a Z-statistic. Denote by fin (f^n,k, respectively) the average 
value of the statistic stat in a random set partition of[n\ (respectively of[n\ into k blocks). 
Then we have 



t^n,k 



1 

Bn 

1 



s. 



n,k 



n! 



V^'''\x)B{x) (3.2) 

Vt^\x)S,.2{x), (3.3) 



where Bn and Sn,k are Bell and Stirling numbers, and B{x) and the Sj{x) 's are exponential 
generating functions of Bell and Stirling numbers defined in (13. 6p and (13.81) . 

After coefficient extraction, Theorem 13.11 can lead to exact and asymptotic formulas 
for average values of Z-statistics. For this purpose, we need some tools that we develop 
in the next subsection. It is worth noting that, here, by an exact formula, we mean 
a finite expression which involves elementary functions, and Bell and Stirling numbers. 
We then present a process to get the average and asymptotic value of Z-statistics. In a 
particular case (which includes all the statistics we consider in this paper), the algorithm 
is deterministic. We will illustrate the process in the third subsection. Finally, we will 
prove our key result. Theorem 13.11 in the last subsection. 

3.1 Basic tools 

3.1.1 Basic properties of Stirling and Bell numbers 

Recall that the number of /c-partitions of an n-set is the Stirling number of the second 
kind Sn,k, and the number of all partitions of an n-set is the Bell number Bn, so that we 
have Bn = Ym:=i ^n.k- Classifying the fc-partitions of an n-set as to whether they do or 
do not contain a given element yields the recurrence: 

Sn,k = Sn^i,k-i + kSn-i,k {n > k > 1) , (3.4) 
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with Sj^Q = Sqj = Sjfi. 

As noted previously, the number of surjections from an n-set A^ to a fc-set K is equal 
to k\ Sn^k- Then, given a set X, \X\ = x, classifying all the maps f : N -^ X according 
to the cardinality of their image leads to the fundamental identity 

" fx\ " 

2:" = ^ f , j k\Sn,k = ^ Sn,k{x)k, 
k=0 ^ ^ A:=0 

from which, by binomial inversion, we deduce the summation formula 

^-^ = ^E(-i)^(-)(^-^r- (3.5) 

Here and in the rest of the paper, {x)k stands for the k-th lower factorial x{x — 1) ■ ■ ■ (x — 
k + 1). Equation (13. 5p leads to the generating function identity 

5.(t):=$:5„,.- = -(e*-l)\ (3.6) 

n>0 

from which we deduce the double generating function 

j-n , , 

F(t,«) := V S„V^ = y2S,it)u' = e<''~'). (3.7) 

n>k>0 k>0 

Setting M = 1, we recover the exponential generating function of Bell numbers 

5(t):=Vs„- = e'^'-\ (3.8) 

^-^ nl 

n>0 

There are of course many ways to recover all the previous identities. For instance, one 
could first obtain the double generating function (13. 7p immediately from the exponential 
formula (a set partition is a "set of nonempty sets" in the language of species), and then 
by specializations, coefficient extractions, etc. recover all the previous results and even 
much more. 

3.1.2 Coefficient extraction 

Given a Z-statistic stat, the first step we have to perform in order to be able to apply 
Theorem 13.11 is to try to find a convenient expression for the power series V.^^""^ (x) . It 
turns out that for all Z-statistics which the author was able to locate in the literature, 
the formal power series V2^°'^{x) is given by 

Vi'^\x) = Po{x) + Pi{x)e' + P2(a;)e2^ (3.9) 
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where the Pi are polynomials. Therefore, we need to develop some tools to cope with this 
situation. Elementary computations based on the formulas (13. 8p and f l3.6p show that we 
have 



e"5(x) = B\x), e'"5(a;) = B"{x) - B\x) 

e^5fc_2(x) = Sk-2{x) + {k- l)5,_i(x) 

e2"5fc„2(x) = Sk^2ix) + 2{k - l)5,_i(x) + kik - l)S,ix). 



(3.10) 
(3.11) 
(3.12) 



This implies that, for V2*"-^{x) satisfying (13.91) . there exist polynomials Qi and -Rj, < i < 2, 
such that 



Vf''\x)B{x) = Qo{x)B{x) + Qi{x)B\x) + Q2{x)B"{x) 
Vt\x)Sk-2{x) = Ro{x)Sk-2{x) + Ri{x)Sk-i{x) + R2{x)Sk{x). 



(3.13) 
(3.14) 

Suppose we are given a formal power series A{x) = ^„>o CLn^ and let A^^\x) be the h-th 
formal derivative A^'^\x) of A(x). Using the basic facts 



X 



n\ 



^W( 



x] 



O-n+h 



X 



n\ 



x'A{x) = {n)ian- 



(3.15) 



coefficient extraction from power series of the form (I3.13P or (I3.14p becomes a routine 
process. In particular, if V2^'^^{x) satisfies (13. 9p . it follows by Theorem 13.11 and (I3.13p - 
(I3.15P that Hn and Hn,k are given by 



A^n 



X^PiH^-^ and /i„,fc 



B„ 






%JW 



S^. 



n—i,k—j 



(3.16) 



n,k 



where the Pi{n) and qij{n) are polynomials in n which are all zero except for a finite 
number of them. 



3.1.3 Asymptotics 

In order to obtain asymptotic approximations of the average values /i„ and fin,k, in view 
of (I3.16p . it is important to consider quotients of Bell numbers and Stirling numbers. We 
begin with Stirling numbers. 

It follows immediately from (13. 5 p that, as n — t- oo, we have 






from which we deduce that, for i,j > 0, we have 

^n-i,k-j _ [k ~ J) /. „ I I ^ 1 



s, 



n,k 



{k-j)\ 
k\ 



1 + 



k-j 
1 + 



k\ 
A;" 

1 



1 + 



{k-j)\{k-jy\ kj V ■ vv k 

Distinct the cases j = and j > 1 leads to the following result 



(3.17) 
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Lemma 3.2. For i > and j > 1, we have, as n -^ oo, 






The asyinptotics of Bell numbers is more delicate and is best performed by a saddle- 
point method. It is probably the most famous apphcation of saddle-point techniques to 
combinatorial enumeration. The Bell number i?„ satisfies (see P^ Prop. VIII. 3]) 

B^ = n\ ^ "\"' ^ (1 + (e-/^)) , 

r^^27rr{r + !)€'■ ^ ^ " 

where r is defined implicitly by re'' = n + 1, so that r = (logra) ■ (log log n) + o(l). Instead 
of the latter formula, we will use the following corollary given by Salyy and Shackell |32j: 

''-^'- " ^ + i^f^^(l + o(l))V (3.18) 



Bn logn \ logn 

Note that the latter identity was established because it gives an asymptotic approximation 
of the average number of blocks in a random set partition of [n]. It is also worth noting 
that there exists a more precise asymptotic approximation (see [32]) which the reader can 
use to get more precise asymptotic approximations for the average values computed in 
this paper. 

Lemma 3.3. For any integer r ^ Z, we have, as n ^ oo, 

^- ^« ^v,,,!2Il2iii(l,„,l))|. (3.19) 



Bn \lognJ \ logn 

Proof. Suppose we are given an integer j. It follows from (I3.18P that 

Bn+i+i n+j ( loglog(n + ?') , / ss , , s 

^■'^ - -^ ' 1+ f !^^ /^ (l + o(l)) . (3.20) 



Bn^j log(n-Fj) V log(n + j 

By elementary calculus and the well-known formula log(l + x) = x(l + o(l)) as x — )■ 0, 
it is easy to establish that 

log n + J = log n 1 + O — j and — — - — — — = — (1 + oil)) , 

\ \n\.ognJ J log(ra + jj logn 

(3.21) 
and thus 



_i;±^.!iIi±i)(lH-0(^))"^^(l + 0(i)). (3.22) 

log(n + j) logn \ \n\.ognJ J logn \ \^ J J 
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Combining (13. 20 p . f l3.2ip . and fl3.22p . we obtain immediately 

i + !2sl2i!^(i + „(!)) I, 



Bn+j+1 n [^ ^ log log ra, 



Bn+j \ogn \ \ogn 

from which we deduce that, for any integer r, we have 

1 — ^ + '^n (1 + ^(1)) 

logny \ logn 

since (1 + xY' = 1 + rx + o (x) as x — > 0. D 

3.2 A process 



By Theorem I3.H in order to get the average values /i„ and n,n,k of a Z-statistic stat, we 
can proceed essentially as follows: 

(i) Compute VJ**"*(x). 
(ii) Extract coefficients from the power series V2^°'^{x) B{x) and V2^"'^{x) Sk~2{x). 

Actually, the above process seems to be nondeterministic, since it is a priori not clear 
whether V2'^°'*{x) admits a simple expression (a finite expression involving elementary func- 
tions). However, as we mentioned previously, for all Z-statistics on set partitions which 
the author was able to locate in the literature, the formal power series V2''*"*(x) admits 
the simple form (13. 9p . For such statistics, one can design a deterministic algorithm. 

Input: a Z-statistic stat such that V2'*"*(x) = Po{x) + Pi{x)e^ + P2{x)e'^^, where the 
Pi are polynomials. 

Output: (a) the exact average values /i„ and fin,k, written as finite expressions which 
involve elementary functions. Bell and Stirling numbers; (b) asymptotic approximations 
of /i„ and iJn,k- 

Step 1. Compute explicitly V2^°'*'{x) (i.e., determine the polynomials Pq, Pi and P2). 
Step 2. Using (l330D - (l332D . write Vi^''\x)B{x) and Vi^''\x:)Sk-2{x) as 

V^'''\x)B{x) = Qq{x)B{x) + Qi{x)B'{x) + Q2{x)B"{x) 
Vt\x)Su-2{x) = Ro{x)Sk-2{x) + Ri{x)Sk-i{x) + R2{x)Sk{x), 

where the Qi and Ri are polynomials. 

Step 3. Using (13.151) . write the coefficients of 1^2*^*"* (x) 5 (x) and V2*'^\x)Sk^2ix) as 



X 



n\ 



V2'"'\x)Bix) = Y,P^i.^)Bn+^ and — Vt\x)Su-2 = Yl ^^^•(^' f')Sn-^,k-J, 

i,j>0 
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where the Pi{n) and qij{n, k) are polynomials in n and k which are all zero except for a 
finite number of them. 

Step 4- Divide the expressions obtained in the previous step by Bn and S'„_fc in order 
to get the average values /i„ and fin,k, which can be written in the form 



X 



n\ 



Vt\x)B{x) = Y,P^{n)^ and 



X'- 



ni 



Vr''\x)S,.,= Y,%An,k)^^^. 






Step 5. Use Lemmas 13.21 and 13.31 in the expressions obtained in the previous step to 
get asymptotic approximations of /i„ and fin,k- 

A proof that the above process is deterministic can be easily extracted from the pre- 
vious subsection (details are left to the reader). To make things more concrete, we give 
an example in the next subsection. 

3.3 Example: an inversion statistic on set partitions 

In his study of some g-Stirling numbers which arose from a g-exponential formula, John- 
son |T7] (see also [8]) introduced an inversion statistic on set partitions, denoted here by 
inv, which can be defined as follows. For it G n„, with standard form tt = B1/B2/ ■ ■ ■ Bk, 
inv{n) is the number of pairs {i,j), I < i < j < n, such that i belongs to a block 
to the right of the block containing j. Equivalently, mv{7r) is the number of inversions 
in the RGF ^(vr) of it. Recall that an inversion in a word Wi...Wn is a pair {i,j), 
^ ^ i < j ^ n, such that Wi > Wj. For instance, there are exactly 3 inversions in 
TT = 14/25/3 = 12 312 = ^(vr): the pairs (2, 4), (3, 4) and (3, 5); hence mv{TT) = 3. Note 
that inv = 0CC21 (where 0CC21 is defined in Section 13731) . 

We propose to determine exact and asymptotic average numbers of inversions in a 
random set partition. Since inv is a Z-statistic, we must first "compute" V^'™(x). 

By definition, V2™(x) is the exponential generating function of the sequence {vn)n>o 
with Vn '■= v'^2 = S-n-en^ ^nv^n). For a combinatorialist, it is usual to interpret Vn as 
the number of pairs (vr,a), where vr is a 2-partition of [n] and a = {i,j) is an inversion 
of TT. Such a pair will be called an underlined 2-partition. An underlined partition (tt, a) 
with a = {i,j) can be identified with the restricted growth function w{tt) in which the 
letters Wi, Wj are colored. For instance, there are 3 inversions in vr = 14/25/3: ai = (2,4), 
02 = (3,4) and 03 = (3,5). Then we have (7r,ai) = 12312, (71,02) = 12312 and 
(77,03) = 12312. 

Then, using this correspondence, it is not hard to see that an underlined 2-partition 
{tt, a) can be uniquely written as 

[tt, a) = 1 ui 2 ^2 1 U3 

with ui,U2,u^ G {1,2}*. Here and in the rest of the paper, we write X* for the free 
monoid generated by an alphabet X, which is the set of words whose letters are in X. 
The empty word will be denoted e. 
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Elementary counting (there are ("2 ) choices for the positions of the underhned ele- 
ments and 2"~^ choices for the word Ui U2 U3) leads to 

0, ifn<l. 

A straightforward computation, the details of which are left to the reader, then yields 

rfi'f' / ly^ ly 1 \ 1 



n>0 



Since V^'"{x) can be written in the form (I3.9p . we can continue the process described 
in the previous subsection. 

Ste'p 2. Using f l3.10p and fl3.12p . it is straightforward to obtain 

V^''''{x)B{x) = -^B{x) + ^ (-1 + 2x - 2x^) B'{x) + ^ (l - 2x + 2x^) B"{x) 

V^J™(x)5fc_2(x) = -^{-x + x^)Sk-2{x) + i(A; - 1) (1 - 2x + 2x^) Sk-i{x) 

+ -k{k-l)il-2x + 2x^)Sk{x). 

o 

Step 3. After routine coefficient extraction based on (13.151) . we obtain 



x"" 



n\ 



Vr{x)B{x) = -i?„+2 -i^ + l) Bn+1 + (x - ^) ^" - ^^"-1 (3.24) 



n\ 



V^2™(a;)5'fe_2(x) = - {-nSn-l,k-2 + (^)25'„-2,fc-2) 



+ -{k - 1) {Sn,k^l - 2nSn-l,k-l + '^{n)2Sn-2,k-l) 

+ -k{k - 1) {Sn,k - 2nSn-l,k + 2(?2)2S'„_2,fc) . 
o 

It is possible to simplify the expression obtained for [^] V2"^{x)Sk-2ix). First, replace 
each occurrence of the left hand sides of the three identities 

Sn-2,k-2 = Sn-l,k-l ~ [k — \)Sn-2,k-l (3.25) 

'S'n-l,fc-2 = Sn,k-l — [k — l)^^-!,^-! (3.26) 

Sn-2,k = T Wn-l,k — Sn~2,k~l) (3.27) 

k 

by the corresponding right hand sides. Then, in the identity obtained from the previous 
manipulation, replace each occurrence of the left hand side of the identity 

Sn-l,k = T Wn,k ~ 5'„,-i fc-l) (3.28) 

k 
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by its right hand side. This gives 

X 



n\ 



Vr{x)Sk-2{x) = Qn (n - fc - 1) M - i j + h{k - 1) j Sn,u (3.29) 

- - (n + 1 - fc) S„,A:-1 H T] '^n-l.fc-1- 

4 4/;; 

Note that identities f l3.25p -( 13.28p could be obtained in an elementary way from (13.41) . 



Steps 4 o,nd 5. Divide the expression fl3.24p by B^ to obtain that the average number 
of inversions fin in a random set partition of [n] is given by 



IB. 



fJ-n 



n+2 



8 Br, 






{n)2 Bn-l 
4 Br. ' 



Using Lemma 13.31 it is straightforward to arrive at the asymptotic approximation 

^^ ^ 1 log log n 



n 



l^r, 



log n (log n) 



1 + o(l)) , ra -)■ oo. 



Similarly, divide expression (I3.29P by Sn^k to obtain that the average number of inver- 
sions fin,k in a random set partition of [n] into k blocks is given by 

t^n,k = \n{n~k-l)(l-\\+ h{k -l)--in + l~k) ^"''^"^ 



4-v- " '' \' k J 8 
n{n - 1) Sr^^x,k-\ 



4 



S. 



n.k 



4fc i^n,k 

Use of Lemma 13.21 then produces the asymptotic approximation 



1 / , , / 1 
(^n,k = -n (n — k — 1) \ 1 



4 
1 



hik-l)^0(n(,-l 



-n(n — k — 1){1 — — ]-\ — k(k — 1) + oil), as n — )■ cxd. 
4 \ k / 8 



3.4 Proof of Theorem [331 

Let stat be a Z-statistic defined on set partitions. It is easy to see that (13. 2p is a corollary 
of (13. 3p . Indeed, by definition of the average value, we have 



±Jr} 



Br, 



Br, 



(3.30) 



fc>i 
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Suppose that (13. 3p is true. Then we have 



f^n J-) / J ^n,k fJ'n,k j-^ / ^ 



B„ 

1 
Br, 



k>l 



X" 



k>l '- 



X 



Vr\x)Su-2{x) 



n\ 



Vr\x)[Y,Sk-2{x)]= — 



,k>l 



X 



n\ 



Vt\x)B{x). 



Therefore, in order to prove Theorem 13. ![ we just have to prove (13. 3p . Set Vm '■= v^mi- By 
the definitions of average value and product of two formal power series, Equation (13. 3p is 
equivalent to 



y^ Stat(7r) = X] f ) Sn^m,k-2 
Tgm m=0 ^ ^ 



^m.' 



(3.31) 



We will prove the latter identity. Let DISJ be the set of pairs of nonempty and disjoints 
subsets of P, i.e., 

DISJ = {{A, B}/A,BCF, A,B ^dS, A n 5 = 0}. 

Then, by definition of a Z-statistic, we have 

^ stat(7r) = Y^ Y^ stat{st{{A, B})) = ^ ^ stat{st{{A, B})) 



A,BC[n] Tren^ 
A,BeiT 



y^ Stat{st{{A, B})) Sn^\AuB\,k-2 



(3.32) 



A,BC[n] 
{A,B}eDISJ 



where the last equality follows from the fact that, for A,BC [n], the number pn,k{A, B) 
of partitions tt in 11^ such that A, 5 G vr is given by 



Pn,k{A,B) 



Sn-\AuB\,k-2, if {A B} E DISJ; 



0, otherwise. 

Now, suppose we are given a set C C P with \C\ = m. We have 

Y ?>tat{st{{A,B}))= Y stat(stc7(^)) = J^ stat(7r) = w„, (3.33) 

{A,B}<^Disj Ten2(c) 7ren2„ 

AVJB=C 

where the first equality follows from the definition of DISJ and the second is a conse- 
quence of the fact that stc send li^iC) bijectively onto 11^. 
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Combining ( I3.32p and fl3.33p . we obtain 



^ stat(7r) = ^ stat{st{{A, B})) Sn-\AuB\,k-2 

A,BC[n] 
A,B}€DISJ 

^ ^ Stat{st{{A, B})) Sn-\AuB\,k-2 

'JC[n]{A,B}eDISJ 
AUB=C 

/ , V\c\ Sn-\C\,k-2, 



Tren* A,BC[n] 

{A,B}eDISJ 



CC[n] 

which is obviously equivalent to fl3.3ip . This concludes the proof. 

Remark 3.1. Using the language of what is called the theory of species one can give a 
quick proof of Theorem \3.1\ We have preferred to avoid this terminology in order to keep 
the paper self-contained. 

4 Carlitz's ^'-Stirling distributions 

4.1 Proof of Theorem [272] 

We will work here with the Z-statistic los, which has the g-Stirling distribution S{q) 
(see [271 El])- Recall that \os can be defined for a set partition tt = Bi/ B2/ . . . jBy. in 
standard form by 

los(7r) = l^sl + 2|53| + . . . + (A; - \)\Bk\. 

We have to start by "computing" V^'^ix). 

Computation ofV^^ix). By definition, V^^[x) is the exponential generating function 
of the sequence (t^„)„>o with Vn := -wj,"! = Yl,-K<=i\il los(7r). 

For a 2-partition tt = Bi/ B2 written in standard form, we have los(7r) = |i?2|. Thus, 
we can see f„ as the number of pairs (vr, z), where vr = Bi/ B2 is a 2-partition of [tt,] (written 
in standard form) and i E B2. The number of such pairs is easily shown to be {n — 1)2"~^. 
Indeed, we have n — 1 choices for the element i (any of the elements of {2, 3, . . . , n}) and 
then 2"~^ choices for the set B2 \ {i} (any of the subsets of {2, 3, . . . , n} \ {i}). Therefore, 
we have 

'0, ifn<l; 



5Z^^"W^ W. non-2 



An -l)2^-\ ifn>2. 



A straightforward computation leads to 



vti^) = E-»^ = E(" - 1)2"- V = i + (4 + f ) ^"- '^■'' 



n>0 n>2 
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Since V2°*(x) can be written in the form (13. 9p . we can continue the process described 
in Section [221 

Step 2. Using f l3.10p and (13.121) . we can write 



Vl-{x)B{x) = \b{x) + [-| + 1] B'{x) + (l - ^ ) B"ix) 



Vt{x)Sk-2{x) = |5fe.2(a:) + ^(^ " 1) ("1 + 2a;) Sk-i{x) + h{k - 1) (-1 + 2x) Sk{x). 



Step 3. After routine coefficient extraction based on (I3.15p . we obtain 



X" 



n\ 



Vtix)Bix) 



-^B„,+2 + (I + 4) ^-+1 + ( ~| + i ) ^- 



(4.2) 



X 



T) 1 

V2°'ix)Sk^2ix) = -Sn^l,k-2 + -{k - 1) i-Sn,k-l + 2nSn-l,k-l) 

+ -k{k - 1) {-Sn^k + 2nSn-i,k) ■ 

It is possible to simphfy the expression obtained for [^] V2''^{x)Sk-2{.x) by replacing 
each occurrence of the left hand sides of identities (I3.26P and (13.281) by the corresponding 
right hand sides. This gives 



X" 



n\ 



V^"\x)Sk-2{x) = ( in(A; - 1) - hz{k - l)\ Sn,k + ^{n + l- k)Sn,k-i. (4.3) 



Steps 4 o-nd 5. Division of expression (14. 2 p by Bn gives the exact value (12. 3 p of yU„, 
while its asymptotic approximation (12. 4p is easily obtained from Lemma 13.31 

Similarly, division of expression (14. 3 p by Sn,k gives the exact value (12.51) of fin,k, while, 
after a routine computation based on Lemma [321 we obtain the asymptotic approximation 



l^n,k = -n{k - 1) - -k{k -l) + o(nll-- 



n — )■ 00, 



which is a refinement of (12. 6p . This concludes the proof of Theorem 12.21 

4.2 Some remarks 

There is an "explicit" expression for the generating function of the g-Stirling distribution 
S{q) (see e.g. [IE]), namely. 



Sn,k{l) 



W- 



EK 



k-j 



j=l 



g^^^^')[jr, 



where \2\ is the usual g-binomial coefficient. It seems difficult to recover the exact 
average values of the g-Stirling distributions from the above expression. However, it is 
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well-known that "generating functions find averages" (see e.g. [36]). In the case of our 
g-Stirling numbers, it is easy to derive from the recurrence (12. 2p that 



i)..k 



F.(.. ,) :^ E «„,(,)." ^ (i_,)(i_;j...(i_|,i^,) . (4.4) 

It is possible to recover the results presented in this section from this generating function 
identity. We can even obtain many more results, such as values of variances and some 
limit laws (see [T9]). 

5 Number of linear crossings 

5 . 1 Proof of Theorem [2731 

Computation of ¥2^ (x) . By definition, V^^ (x) is the exponential generating function 
of the sequence (t^„)„>o with Vn := v^^'J = Y^nenl cr'^'^K'^)- 

We can interpret f„ as the number of pairs (vr, C), where vr is a 2-partition of [n] and C 
is a linear crossing of vr. Such a pair will be called an underlined 2-partition. 

An underlined partition (vr, C) with C = {i,j){k,i) can be identified with the re- 
stricted growth function w{n) in which the letters Wi, Wj, Wk and Wi are colored. For in- 
stance, there are 3 linear crossings in vr = 14/2 5/3 6 7: Ci = (1,4)(2,5), C2 = (1,4)(3,6) 
and C3 = (2,5)(3,6). Then we have iTT,Ci) = 1231233, (71,^2) = 1231233 and 
(tt, C3) = 1 2 3 1 2 3 3. 

Using this correspondence, it is not hard to see that an underlined 2-partition (vr, C) 
can be decomposed uniquely in one of the following two ways: 

(a) (Tr,^) = mi12^'i211'=^2m2, 

(b) (vr,C) = t;i21^il2 2^nt;2, 

with ui = e or ui e 1{1,2}*, U2 G {1,2}* and ki,k2 > 0; Wi G 1{1,2}*, V2 G {1,2}* and 
ii,i2>0. 

Simple rules of counting and manipulations of generating functions (see e.g. [T3l [36] ) 
then show that the ordinary generating functions F^^-^x) and F^^\x) for underlined 
2-partitions of [n], n G N admitting a decomposition of type (a) and (6), respectively, 
are given by 



1-2x7 Vl-a;/ Vl-a;/ \l-2xj (l-x)2(l-2x) 



^2 



l^/l^2/^l^/l^ ^^ 

X) = \ X —- X X X 



^{b)(^^\ _ / ™ " \ ^ I ^ \ ^2 



1-2x7 \'^-^J \^-^J \l-2xj (l-x)2(l-2x)2' 

Thus, the number of underlined 2-partitions of [n], i.e., f„, is just the coefficient of x" in 
the power series 

4 
F{x) = F^^\x) + F^''\x) "^ 



(l-x)2(l-2x)2' 
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Figure 2: Linear representation of a 2-partition 



A routine coefficient extraction based on the partial fraction decomposition 

Fix) = ,, ,,,,, .,,, = x' 7-—^ + ^— + 



;i-a;)2(l-2x)2 V(l-a;)^ l-x (1 - 2a;)2 f - 2x 

gives 

^, [(n-5)2"-2 + ^ + l^ ifn>4. 

A straightforward computation then leads to 

V^ix) = E-«5 = (f - I) -" + (- + 1) e^ + ^- (5-1) 

n>0 ■ ^ ^ 

Of course, there are many other ways to compute the coefficients f„. We now present 
another way based on a decomposition of 2-partitions which is particularly well-adapted 
to the computation of the coefficients f„ in the case of circular crossings. 

Via its restricted growth function, a 2-partition can be viewed as a word on the 
alphabet {1,2} whose first letter is 1. It is then immediate to see that each 2-partition vr 
can be decomposed uniquely as 

for some k > 1, where £j, rrij > 1 for 1 < j < k, and ik+i > 0. 

For k > 1, denote by 7^ the set of 2-partitions with such a decomposition. A moment's 
thought (see Figure [2]) will convince the reader that 

• if ttgTi, crW(7r) = 0, 

.ifvrGr„A:>2,crW(vr) = |2^-'' '^ ^^^^ = ^'^ 

^ ^ \2A;-2, if4+i>l. 

It is easy to see from the definition of Tk that the ordinary generating functions Gj^ (x) 
and GI (x) of 2-partitions of [n], n G N, in Tk satisfying i^^i = and ik+i > 1, respec- 
tively, are given by G^ (x) = {jz^) and G\'{x) = (y3;) • It follows that for /c > 2 
we have 

J2 cr(7r)a;ll"ll = {2k - 3)Gi'\x) + {2k - 2)Gf\x) 
TeTfc 
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where we have set ||7r|| = n for vr G n„. 

Using the latter equahty and the formal identity X]fc>i ky'' = 7X3^) 
computation we obtain 



after a routine 



n>0 fc>2 TrSTfc 



X 



(1-X)2(1-2X)2' 



as it was previously derived. 

Since ¥2^ (x) (see (15. ip ) can be written in the form (13.91) . we can continue the process 
described in Section [3^ 

Step 2. Using dXTU]) . (IXTTD and (IXT^ . we can write 
V2^''^'\x)B{x) = -B{x) + -{9 + 2x)B'{x) + ^(-5 + 2x)B"{x) 

Q 1 1 

V2''-''\x)Sk-2{x) = y5fc_2(x) + 2 (^ - l)(-3 + 4x)5,_i(x) + -A;(A; - l)(-5 + 2x)Sk{x). 
Step 3. After routine coefficient extraction based on (13.151) . we obtain 



X 



n\ 



Vr {x]Bix] 



X 



n\ 



2 



vr {x)Sk-2{x) 



4 

3n 



-B, 



n+2 



n 9 

2 + 4 



Bn+l 



f-il^ 



— 5'n-i,fc-2 + -(^ — 1) (— SS^^jfc-i + 4nS'rt_i,fc-i) 
+ -/c(/c - 1) {-5Sn,k + 2nSn-i,k) ■ 



(5.2) 



It is possible to simplify the expression obtained for [^] V^"^'' {x)Sk-2{x) by replacing 
each occurrence of the left hand sides of identities (I3.26P and (13.280 by the corresponding 
right hand sides. This gives 



X 



n\ 



V2''''\x)Sk-2{x) = ( ^n{k - 1) - ^k{k - 1)\ Sr,,k + ^(n + 1 - k)Sr,,k-i. (5.3) 



Steps 4 o-nd 5. Division of expression (15. 2p by i?„ gives the exact value (12. 7p of yU„, 
while its asymptotic approximation (12. 8p is easily obtained from Lemma 13.31 

Similarly, division of expression (15. 3 p by Sn,k gives the exact value (12.90 of /i„,fc, while, 
after a routine computation based on Lemma 13.21 we get the asymptotic approximation 



,.,., = \n(k-l)-\i^\+oL(l-\ 



n — )• 00, 



which is a refinement of (I2.10p . This concludes the proof of Theorem 
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5.2 Some Remarks 

It is worth noting that there has been considerable interest in studying crossings in match- 
ings (set partitions each block of which has exactly two elements) and set partitions. 

In the case of the number of crossings in matchings, a remarkable formula, often called 
the Touchard-Riordan formula, was made explicit by Riordan [28], who also mentioned 
the exact values of the average and the variance of the number of crossings in matchings. 
Though they are "contained" in the Touchard-Riordan formula, it is not a priori clear 
whether they admit a simple form. Later, Flajolet and Noy [11] used a certain decom- 
position of the number of crossings to give a "direct" proof for the average value. This 
decomposition was later generalized by the author [20] to compute the average number 
of linear crossings in a random set partition and is at the origin of this work. 

Let T„ fc(g) be the generating function of set partitions of [n] into k blocks with respect 
to the number of linear crossings, i.e., Tn^kiq) = X^Trgn^ 1^^ ^'^^- Biane [1] (see also [22] ) 
found a continued fraction expansion for the generating function 'Ylin>k>o'^n,k{<l) o.^'l^"' ■ 
Recently, Stanton, Zeng and the author (see Equation 28 in [;21j) proved that 

y T^ k{q) a'e = y —, ^^^ , (5.4) 

from which they derive the following remarkable formula for Tn^kiQ) (see Equation 32 

in my- 



-..^^<^)^E<-)-i|^^/--)-(C>- 



n 
i-1 



Another remarkable formula for Tn^kil) was established recently by Josuat- Verges and 
Rubey [H]: 



T^M - (r3^EE(-')' {{,:,) (.!,) - (.,:, 



.......... , , ...... ^, .k — i 



i'V) 



Again, though the average number of linear crossings is "contained" in the above two 
formulas, it is not a priori clear whether they admit a simple form. However, it is possible 
to recover the exact average values presented in this section and even exact values for the 
variance of the number of linear crossings (see [19]) from the generating function (15. 4p . 
Moreover, a limit law will be given in [T9] . 

Finally, we want to point out that there exist combinatorial parameters which have 
the same distribution (hence, the same average value) as the number of linear crossings 
on each 11^. This is the case of the number of nestings of two arcs [22] and the major 
index for set partitions introduced in [5]. 
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6 Number of circular crossings 

6.1 Proof of Theorem [231 

Computation of ¥2^ " (x) . By definition, V^*^^ " (x) is the exponential generating function 
of the sequence (t;„)„>o with Vn := <y' = Evrena cr(^)(7r). 

Suppose we are given a 2-partition tt in Tk- By definition of Tk (see Section [5?T!) . this 
means that there exist integers ^jiTUj > 1 for 1 < j < A;, and ik+i > such that 

A moment's thought (see Figure [3]) will convince the reader that: 

• if A; = 1, cr(^)(7r) = 0, 

• if A; > 3, cr(^)(7r) = 2k. 





Figure 3: Circular representation of a 2-partition 
This implies that, for A; > 3, we have 



X \ 1 , x'^'' 



(\ 1 
^—1 = 2A;- 
1-xJ 1-x (1 



X)2fc+l^ 



vrGTfc 

since the generating function Gkix) = J^weT '''"''^' '^^ 2-partitions in % is given by Gki 

y 



(i3^) (t3^)- Using the formal identity Ylik>Q ky'^ = jTZ-y^ it is then easy to obtain the 



identity 

> cr^ -'(vrlx" " = > 2A;- tttt-t = 7 ^ 7777 zr- (6.1) 



x^^ 2x^(3 -6x + x^ 



,^ X)2fc+1 (l-2x)2(l-x)5' 

fc>3 ttGT; fc>3 ^ ' ^ J \ J 

For the case fc = 2, we have to be more careful. By definition, a partition in T2 
can be written in the form vr = \^^2'^^1^'^2'^^1^'-^ ., with £i,i2,mi,m2 > 1 and £3 > 0. 
We distinguish four cases. Instead of long-winded explanations, we prefer to present the 
results in the table given in Figure HI The first column lists the different cases considered, 
while the second column indicates the corresponding circular representations. In each case, 
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sketch of the circular 
representation 



value 
of cA'^' 



generating 
function 



case (aj 

h=i2 = 1, is 

nil = 1^2 = 1 




X 



case (b) 

4 = ^2 = 1, ^3 = 

nil + ^n-2 ^ 3 




X 

1-x 



case [c) 
nil = "1-2 = 1 




2 X 



l~x I 1—x 



case (d) 

^1 + ^2 + ^3 > 3 

nil + m2 > 3 




X \ 1 

1— a; / 1—x 



X 

l-x 



Figure 4: Circular crossings in a 2-partition in 72- 



the statistic cr^^^ assumes a constant value: the third column provides these constants. 
Finally, the (ordinary) generating function of 2-partitions in each case is given in the 
fourth column. For instance, the first and third row read as follows: 

• case (a): there is one and only one partition tt = 13/2 4, cr'^'^)(7r) = 1 and the 
generating function for this case is x'^. 

• case (c): the parameter cr^'^^ is equal to 2 and the generating function of this class 



IS X 



Y^) j^. The other rows have to be read similarly. 



Let Fi, F2, -F3 and F4 be the generating functions of 2-partitions in case (a), (6), (c) and 
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(d), respectively. It follows from a reading of the entire table that 

^ cr(^)(7r) x"-" = 1 Fi(x) + 2 F^ix) + 2 Fs{x) + 4 F^{x) = x^ + ^^ (5 -^4x + x ) 

Combining (16. ip and the latter equality, we obtain 

7ren2 fc>i ttgt; 7re75 fc>3 neTk 

4 ^ 2x^(5 -4x + x2) ^ 2x^(3 -6x + x2^ 



X^ + h-. 77 - + 



1-X)5 (l-2x)2(l-x) 



4 2x^(5- llx + 4x^ 

X + 



(l-x)3(l-2x)2 ■ 

A routine coefficient extraction based on the partial fraction decomposition 

4 2x^(5 -llx + 4x2) 5x 2 4 4 10 6 

X +^ 7^7- --TT^ = -— -2x2+X^ 



(l-x)3(l-2x)2 2 (1-X)3 (l-x)2 1-X 

1 1 

^4(l-2x)2 ~ 4(1 -2x) 

gives 

{0, if < n < 3; 

1, if n = 4; 

^2"-2 + 4n-2n2, ifn>5. 

A straightforward computation leads to 

K^'^'^' (x) = 1 xe^- + 2x(-x + l)e" + — - x^ - - x. (6.2) 

2 V ; 2 ^ ^ 24 2 ^ ^ 

Since Vg'^'' " (x) can be written in the form (13. 9p . we can continue the process described in 
Section 13. 2[ 

Step 2. Using (IXTUj) . dXTT]) and (IXT^ . we can write 

vf^\x)B{x) = — (x^ - 24x2 - 60x) B{x) + ^(-4x2 + 3x)5'(x) + -B"{x) 
Vt\x)Su-2{x) = ^ (a;' - 72x2) ^^_^(^) + (A; - 1) (-2x2 + 3x) Su-i{x) 

+ -k{k - l)xSk{x). 
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Step 3. After routine coefficient extraction based on (13.150 . we obtain 

^^ vf'\x)B{x) = ^fi„+i + y5„ -2n(n + i") fi„_i - (n)25„_2 + ^(n)45„_4 

(6.3) 

V^"" " (x)5'a:_2(x) = — {{n)iSn-A,k-2 " 72(n)2S'„_2,fc-2) 

+ (A; - 1) (-2(r2)25'„_2,fc-i + ^nSn-i,k-i) 



n\ 



1 
+ -k{k - l)nSn-i,k- 

It is possible to simplify the expression obtained for [^] ¥2^ " {x)Sk-2{.x) by replacing 
each occurrence of the left hand sides of identities (I3.28P and 

'S'n-2,fc-l = -, (5'n-l fc-l — >S'„_2.A:-2) (6-4) 

k — 1 

by the corresponding right hand sides. This gives 

^1 Vf\x)Su-2{x) = ^^^^^5„,, + ^(-4n - 1 + 5A;)^„_i,,_i (6.5) 

— {n)2Sn-2,k-2 H 77T-Sn-4:,k-2- 

Steps 4 and 5. Division of expression (16. 3p by -B„ gives the exact value (12. lip of /i^, 
while its asymptotic approximation (I2.12p is easily obtained from Lemma 13.31 

Similarly, division of expression (16. 5p by Sn^k gives the exact value (I2.13P of /i„ fc, while, 
after a routine computation based on Lemma 13. 2[ we get the asymptotic approximation 

IJ,n,k = -n{k - I) + O in^ il - -\ j, n ^ oo, 

which is a refinement of (I2.14p . This concludes the proof of Theorem 12.41 

6.2 Some Remarks 

While the combinatorial parameter cr*^^) has received a considerable interest, it seems that 
the natural parameter cr'^'^^ was ignored. It is interesting to ask whether this parameter 
admits "closed" formulas similar to the formulas for cr^^^ presented in the previous section. 
Some results on the asymptotic distribution of the parameter cr^'^^ will be given in |19] . 

7 Number of overlappings 

7.1 Proof of Theorem 12.51 

Computation ofV2^\x). By definition, V2'"\x) is the exponential generating function of 
the sequence (Wn)„>o with Vn := f^^s = Y.-K&nl'^^K'^) ■ 
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Since ovl assumes only the values or 1 on 11^, the term f„ is just the number of 2- 
partitions of [n] that overlap, i.e., the number of 2-partitions n = B1/B2 of [n] such that 
1 = TainBi < mini?2 < raaxBi < maxi?2 = n. It is easy to see that there are exactly 
2"~^ partitions tt = B1/B2 of [n] such that mini?i = 1 and maxi?2 = n, and among them 
only n — 1 do not overlap (these are the partitions tt = \^2'^~'^ with 1 < k < n — 1). 
Consequently 

v^ ,, . f2"-2-n + l, ifn>2; 
Vn= 2^ ovl [7 



TT 



rem 



0, 



A straightforward computation leads to 



Vr\x) 



^ x" _ 1 



,2x 



if n < 1. 



X 



n>0 



e- + (_a; + l)e-----. 



(7.1) 



Since V^°*''(x) can be written in the form (13. 9p . we can continue the process described in 
Section 13. 2[ 

Step 2. Using fl3:T0D . fl3lTD and fl3:T2ll . we can write 



Vr\x)B{x) 



----] Bix) 
2 4 ' ^ ^ 



~x + -1 B'{x) + -B"{x) 



4 



4 



3x 



1 



V°^\x)Sk-2{x) = -—Su-2{x) -[k-l)[x- -\ Su-i{x) + -k{k - l)Su{x) 
Step 3. After routine coefficient extraction based on (13.151) . we obtain 



x" 
n! 



V^^\x)B{x) = ^5„+2 + \b^+i -(n+-\B^- ^5„_i 



(7.2) 



a; 
n! 



3r2 1 

V2''\x)Sk-2{x) = —Sn-l,k-2 - -{k - 1) {2nSn-l,k-l - ^Sn,k-l) 

+ jk{k- l)Sn,k- 

It is possible to simplify the expression obtained for [^] V2'"\x)Sk~2{'x) by replacing each 
occurrence of the left hand side of identity (13.261) by the corresponding right hand side. 
This gives 



X 



n\ 



Vr{x)Sk-2{x) = -k{k - l)Sn^k --{n+l- k)Sn,k-l 



+ -n{k - l)^^-!,^-!. 



(7.3) 



Steps 4 o.nd 5. Division of expression (17. 2p by Bn gives the exact value (I2.15P of /i„, 
while its asymptotic approximation (I2.16P is easily obtained from Lemma 13.31 
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Similarly, division of expression (17. 3p by Sn,k gives the exact value fl2.17p of fin,k, while, 
after a routine computation based on Lemma IX^ we obtain the asymptotic approximation 

1 A\ „ / / i\" 



which is a refinement of f l2.18p . This concludes the proof of Theorem |2T 

7.2 Some remarks 

Although it seems that the parameter number of overlappings was never considered be- 
fore, it is naturally suggested by the article [12], in which Flajolet and Schott considered 
non-overlappings partitions. In particular, they obtained the generating function of non- 
overlapping partitions in the form of a continued fraction expansion. It is worth noting 
that a continued fraction expansion for the generating function of set partitions with 
respect to the number of overlappings follows painlessly from earlier combinatorial inves- 
tigations (see e.g. p^[T2]). 

We also want to point out that, as n — )■ oo, /i„fc converges to (2). That this is also 
the average number of inversions in a random permutation of size k is far from being a 
coincidence, as we will explain in future work. 

8 Number of Embracings 

8.1 Proof of Theorem [2761 

Computation ofV2"^^{x). By definition, V^^'"''(x) is the exponential generating function 
of the sequence {vn)n>o ^^^^ "^"^ •= "^n™^ = ^^Gna emb{7r). 

Since emb assumes only the values or 1 on 11^, the term Vn is just the number of 
2-partitions of [n] which are embracing, i.e., the number of 2-partitions vr = B1/B2 of [n] 
such that 1 = mini?i < mini?2 < maxi?2 < maxi?i = n, or, equivalently, such that 
mini?i = 1 and maxi?i = n. Obviously, for n > 2, there are exactly 2"~^ — 1 such 
2-partitions. Consequently 

^ /2"-2-l, ifn>2; 

^2 [O, ifn<l. 

A straightforward computation leads to 

n>0 

Since V2'^^{x) can be written in the form (13. 9p . we can continue the process described in 
Section 13. 2[ 
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Step 2. Using (IXTOj) . (I3ll]) and (13J2|) . we can write 

Vr\x)Sk.2{x) = |5fc_2(a;) - ]^{k - l)Su-i{x) + h{k - l)Su{x). 
Step 3. After routine coefficient extraction based on f l3.15p . we obtain 

- yr'(^)5(^) = 4i?n+2 - ^5n+l + ^Sn + 2^-1 (8.2) 



X 



n\ 



n \ 1 

Vr'W'^fc-2(a;) = -Sn-i,k^2 - -{k - l)Sn,k-i + -k{k - l)Sn,k. (8.3) 



Steps 4 and 5. Division of expression (18. 2p by Bn gives the exact value fl2.19p of /i„, 
while its asymptotic approximation f l2.20p is easily obtained from Lemma 13.31 

Similarly, division of expression (18. 3p by Sn,k gives the exact value (I2.2ip of fin,k, while, 
after a routine computation based on Lemma 13. 2[ we get the asymptotic approximation 

which is a refinement of (I2.22p . This concludes the proof of Theorem 12.61 

8.2 Some remarks 

It is worth noting that, like for overlappings, it is painless to derive a continued fraction 
form for the generating function of set partitions with respect to the number of embracings 
by making use of the combinatorial theory of continued fractions [10] . 

One can also consider a variation of embracings. Say that two sets B and B' strongly 
embrace each other if, with the implied order structure, min(i?) < min(i?') < max(i?') < 
max(i?). Then one can show that the statistic 'number of strong embracings' has the same 
distribution (hence, the same average value) on each n^ as the number of overlappings. 
This can be proved using the combinatorial theory of continued fractions or by a direct 
combinatorial argument. 

9 Number of occurrences of a 2-Pattern 

9.1 Proof of Theorem [221 and Theorem [2781 



Computation of V2^^'^{x). Let a = cricr2 . . . cr^ be a 2-pattern of length r. By def- 
inition, 1^2°'^'^'^ (x) is the exponential generating function of the sequence (fn)„>o with 



OCCfT 

Vn := V„o = > ,„^rr2 OCC„ TT 



^n,2 — Z^TTGm 



Z^TTg 



We can interpret f„ as the number of pairs (vr,t), where vr is a 2-partition of [n] 
and t = {ii,i2, ■ ■ ■ , ir) is an occurrence of a in tt. Such a pair will be called an underlined 
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2-partition. An underlined 2-partition (vr, t) with t = {ii,i2, . . . , ir) can be identified with 
the restricted growth function w{7i) in which the letters Wi-^jWi^,. . . ,Wi^ are colored. For 
instance, if a = 2 1 2, the 3-tuple t = (2, 4, 5) is an occurrence of a in vr = 1 4/2 3 5 and 
(7r,t) = 12212. 

Using this correspondence, it is not hard to see that an underlined 2-partition can be 
decomposed uniquely as 



TT = Ml 0"! ^2 0"2 ■ ■ ■ M-r 0"r Mr+1 if 0"! = 1, 

with Ml G {e} U 1{1, 2}* and Ui G {1, 2}* for i = 2, . . . , r + 1; 

vr = 1 t>i CTi t>2 (72 ■ ■ ■ Vr CTj. Vr+1 if CTi = 2, 

with Vi G {1, 2}* for z = 1, . . . , r + 1. In particular, this proves that Vn (and thus V^'^^'^ix)) 
depends only on the first letter of a. For i G {1,2}, let Vn equal the value of Vn when 
(Ji = i. It follows from the above decomposition of underlined 2-partitions combined with 
an elementary counting that for n > r, 

^'''='^ - Czl) T-^- + (";^) 2"-'-i, if ai = 1; 



Vn' ■ = (";') 2"-'-l, if CTi = 2. 



2^ OCC^(7r) - S| ^^(^^^2) _ (n-l\ on-r-1 
vrgna 



To see the above result in the case ai = 2, for instance, just observe that there are ("" ) 
choices for the positions of the underlined elements cri,o"2,. . . ,o"r and then 2"'"''"^ choices 
for the word f i f 2 ■ ■ ■ "^r+i- The case ui = 1 can be treated in a similar way. 

For i G {1, 2}, let ^^'''^^^(x) equal V^^^'^ix) in the case where ai = i, i.e., vi'''^'\x) = 
Sn>o'^" ^~ ^- -^^^ later simplifications, it is important to note that it suffices to deter- 
mine only one of the Vg (x)'s to get the other. Indeed, an elementary computation 
yields 

from which we deduce easily (using (19. 2p . for instance) that 

v,"-(.) + Ki"-)(.) = y: h"^" + -;r-^') ^ = E (" 2"-' ^ = 7 ^"- (9-1) 

n>0 ■ n>0 ^ ^ 

We choose to deal with V"2 i^)- In order to obtain a convenient expression for 
V^ (3^)) we need some additional materials. Let fc be a nonnegative integer and 
{n)k '■= n{n — l){n — 2) ■ ■ ■ [n — k + 1), as before. Given a formal power series A{x) : = 
J2n>o^ri^, it is easy to establish the formal identity 

J2in)kan^ = x'A^'\x). (9.2) 

n>0 
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What about the power series ^„>o(^— l)A;an^? Using the Chu-Vandermonde convolu- 



tion, we have 






from which we deduce that 



(n- l),a„^ = y^{-lf-^-x^A^i){x). (9.3) 

n>0 j=0 ■' 



In particular, for a„ = 2", we obtain 

^(n _ l)fc2"^ = ^(-l)^-i^x^2^e2- = ^(-l)'=-i^x^2-'^ ^\ (9.4) 

n>0 ■ j=0 •'' \i=0 -^^ / 

We can now express the power series V^2 (^) (and Vg (a^)) in a convenient form. 

We have seen earlier that Vn~ = ("^ ) 2"^^'~^ ioi n > r {vn^~ = 0, otherwise). After 
elementary manipulations, one gets 

n>r V / ■ ■ „>o 

For any integer r > 0, set 



Pri^y-=i2y^^'- (9-6) 



j\2r+l-j 



Combining (19. 5p and (19. 4p . we arrive at 



2T + 1 

Combining the latter identity and (19. ip . we get 



Vt'-'\x) = i-^ + Pr{x)e'\ (9.7) 



vt^'\^)=^-7S+{-Prix) + ^)e'^. (9. 



2r+l 



T! 



r/ie connecting relations. We now establish relations (I2.23P and ( 12.29^ . Instead 
of (I3.12p . we will use the identity 

e'^Sk-2{x) = S',:{x)-S',{x), (9.9) 
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which can be obtained from (13. 6p by a straightforward computation. Combining (19. ip 
and fl9.9p . we can write 



V, 



K=i), 



{x) + vi'''-^\x)] S,^,{x) 



t\ 



{Sl{x) - S',{x)) 



and, after a routine coefficient extraction based on fl3.15p . we obtain 



n\ 



Vr 



(<^1=1), 



x) + V, 



(ai=2). 



{x)] Sk-2{x) 



n 



\'^n+2—r,k >^n+l—r,k} 



Summing the both sides of this identity over all integers fc > 0, we obtain 



X" 



n\ 



v.. 



K=i), 



x) + V, 



K=2), 



n 



X) ) B{x) = ( '" ) {Bn+2^r - ^n+l-r. 



Br, 



(9.10) 



(9.11) 



It suffices now to divide expression ( 19.10p by Bn and expression (19. lip by S'n,fc, respec- 
tively, to obtain the relations (I2.23p and (I2.29p . 



Exact average values in the case o"i = 2. Combining (19. 7p and (19.90 . we can write 



V, 



K=2) 



\r+l 



Sk-lix) 



— Sk-2{X) + Pr{x)Sl{x) - Pr{x)S',{x) 



After a routine coefficient extraction based on (I3.15p . we obtain 



X" 



n\ 



vt='\x)S,^2{x) = Y1 



ly-j 

JirjK^Jj Wn+2-j,k ~ ^n+l-j,k) 



j=0 



j\2r+l~j 



-1 



ir+l 



2r+l 



'^n,k- 



A rearrangement of the terms of the sum on the right hand side gives 



X 



n\ 



V. 



K=2), 



x)Sk-2{x) 



Ti — Sn,k-2 + / ^Pj{'n)Sn+2-j,k 



2r+l 



i=o 



1 fn 



2 Vr 



»-'n+l-r,fc; 



(9.12) 



where Pj{n) is given by (I2.24p . 

Division of expression (I9.12p by Sn^k gives the exact value of /i„ ^~ given in Theo- 
rem 12.81 while the exact average value of iJ/n^~ given in Theorem 12.81 can be obtained 
easily from the exact value of /x„ ^~ by using (I3.30p . 



Asymptotic approximations of the average values in the case cri = 2. The exact average 
value of ^n~ we found is 



„K=2) 



■1) 



r+1 



2r+l 



j=0 



n 



Bn+2-j 1 (n\ Bn+l 



Br, 



2 Vr 



Bn 
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It follows easily from Lemma [3.31 and basic representations of binomial coefficients that, 
as n — i- oo, we have 



Pj{n) 



Bn+2-j (-1)' ^ [ ['^\ ,1/^ ^ ^^ Bn+2~j 



Bn 2-+!-^- vVj7 2Vj-i;; Bn 

2-J 



= (zip!!! fi + o fi) V^) Yi + (2 - ,)l2li2ii! (1 + (1))) 

= ^4!^ V (log-r-^ (i - u - 2)^^?^^ + o n.sf}^]] (9.13) 

2^+1-J j\ ^ ^ V log"- V log^ // 

for any integer j > 0. Similarly, as n — )■ oo, one can prove 

n\Bn+i-r 1 ., N^-i /^i / .xloglogn A 

n(logn) l-(r-l)— (l + o(l)) . (9.14) 



^ry i?„ r! \ logn 

Combining (I9.13P and (I9.14p . we get after basic comparisons 

(ai=2) 1 2/1 v-2 A f o^^°slogn /loglogn\ 

2r! \ logn \ logn / 

which is exactly (12.251) in the case ui = 2. 

The exact average value of /i„ ^~ we found is 

((Ti=2) _ V;^ / ^^n+2-j,k 1 / ^\ "Jn+l-r.fc {~^) ^n,k-2 



(<Ti=2) V^ / X Sn+2-j,k 1 / 

j=0 



It follows easily from Lemma [3.21 and basic representations of binomial coefficients that, 
as n — 7- oo, we have 

which is a refinement of the asymptotic approximation (I2.30p given in Theorem 

The case cxi = 1. The exact and asymptotic approximations of the average value 
IJ'n~ {l^nk~ 1 respectively) given in Theorem 12.71 (Theorem 12.81 respectively) can be 

easily obtained from the exact and asymptotic approximations oi (Xn~ ifJ'nk~ ' respec- 
tively) obtained earlier combined with relation (12.231) (respectively (I2.29P ). The details 
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are left to the reader. 

Patterns of length 2. To end the proof of our theorems, we need to estabhsh asser- 
tion {y) of Theorem 12.71 In fact, we have aheady shown in Section 13.31 that (I2.26P is 
an asymptotic approximation of /i„ (which is actually equal to /ij^'" as we remarked 
earlier). Almost the same asymptotic computation shows that fl2.26p is an asymptotic 
approximation of yUn whose exact value is given in f l2.28p (the details are left to the 
reader) . 

This concludes the proof of Theorem 12.71 and Theorem 12.81 



9.2 Remarks 

The notion of pattern containment in set partitions we have considered in this section is 
due to Sagan [31] . There is also an older and different notion of pattern containment in 
set partitions due to Klazar (see e.g. [23J). Actually, it can be shown that the statistic 
'number of occurrences of a 2-partition' (according to Klazar's definition) is also a Z- 
statistic. One can show by either applying the methodology developed in this paper or 
by direct combinatorial arguments that, with Klazar's definition of pattern containment, 
the average number of occurrences of any 2-partition vr of [r] in a random set partition of 
[n] is given by 



r B, 



n 



Mansour, Shattuck and Yan [26] have considered a restricted case of pattern contain- 
ment in set partitions where it is required that the occurrence has to be contiguous. They 
obtained some average value results for numbers of restricted occurrences of some pat- 
terns. Note that their results can not be recovered with the methodology of this paper 
since, in the contiguous case, numbers of occurrences are not Z-statistics. 

10 Concluding remarks 

10.1 Average values of Z-statistics of depth r 

There is a natural way to generalize Z-statistics. Given an integer r > 1, we will say that 
a set partition statistic STAT is a Z-statistic of depth r if for any vr G 11, 

STAT{ti)= Y^ STAT{st{Ai/A2/ ■■■/Ar)). (10.1) 

Ai,A2,...,ArG7r 

Examples of Z-statistics of depth r which can be found in the literature are the numbers 
of r-crossings and r- nestings of arcs |H], the number of occurrences of an r-pattern with 
Sagan's definition (see Section 19. 2p and the number of occurrences of an r-partition with 
Klazar's definition (see Section I 
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Theorem 13.11 can be generalized in a straightforward manner to simphfy the compu- 
tation of average values of Z-statistics of depth r. Given a Z-statistic of depth r stat, 
let w^*"* := J^ttgu^ statin), and let l^'^*"*(x) be the exponential generating function of the 
sequence «,f )„>o, i-e., 



vrix) = J2 <f ^ = E E ^'<^) ^- (10-2) 



n! ■' — ' ^ — ' n\ 

n>o ji>o vrenr; 



Then one can prove the following result. 

Theorem 10.1. Let stat be a Z-statistic of depth r. Denote by fin (respectively fin,k) the 
average value of the statistic stat in a random set partition of [n] (respectively of [n] into 
k blocks). Then we have: 



H"n,k 



1 

Bn 

1 



n! 



S, 



n,k 



n\ 



V;'^\x)B{x) (10.3) 

V:'^\x)Sk-2ix). (10.4) 



Note that the point of the above result is that the computation of average values of a 
Z-statistic of depth r in a random set partition is essentially equivalent to the computation 
of its average value in a random r-partition. 

As open problems, we ask the following questions. What are the (exact and asymp- 
totic) average values of the numbers of r-crossings, r-nestings, occurrences of an r-pattern 
in a random set partition of [n]? 

10.2 Other exponential families 

In this paper we concentrated on set partitions but we might consider any other exponen- 
tial family. An exponential family is, roughly speaking, a set of structures that are built 
out of connected pieces (see e.g. [3^ for a precise definition). One can easily establish 
a generalized version of Theorem 13.11 for exponential families to simplify computation of 
average values. We do not think that it is necessary to state the generalization, but we 
give a typical illustration of application. 

Consider a natural generalization of matchings, namely the (exponential family of) set 
partitions all blocks of which are of size m {m is a given integer > 1). For these partitions, 
we can consider the statistics 'numbers of crossings in linear and circular representations' 
and it is easy to see that these statistics are still Z-statistics on such partitions. Then one 
can prove the following results. 

• The average number jj,,J^^. of linear crossings in a random set partition into k blocks 
of size m, m > 1, is given by 




f^im") = L, m - 2 + 



(2m\ 
\ m ) 
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• The average number jjlY^^s of circular crossings in a random set partition into k blocks 
of size m, m > 3, is given by 

(k\ ( I 1 4m 

Note that, when m = 2, we recover aheady known results (see e.g. [HI [28]). 
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